Corpus: lmo_wikipedia_2014

Other corpora

2.2.5 Most frequent word beginnings

The most frequent word beginnings as character N-grams for N=1...5 with Zipf's diagram


Zipf's diagram for word beginnings


Gnuplot diagram

Top Characters
word rank frequency n-gram
1 6934 s-
2 6922 c-
3 5823 p-
4 4921 d-
5 4635 l-
Top Character Bigrams
word rank frequency n-gram
1 2662 l'-
2 1692 cu-
3 1664 co-
4 1599 pr-
5 1420 in-
Top Character Trigrams
word rank frequency n-gram
1 664 cun-
2 659 con-
3 655 l'a-
4 509 pre-
5 479 par-
Top Character 4-Grams
word rank frequency n-gram
1 196 l'in-
2 196 inte-
3 193 cunt-
4 184 cump-
5 171 cuns-
Top Character 5-Grams
word rank frequency n-gram
1 129 inter-
2 79 parti-
3 74 contr-
4 66 Caste-
5 61 cuntr-
1590 msec needed at 2017-10-27 02:49